No-Regret Learning in Repeated Bayesian Games
نویسندگان
چکیده
Recent price-of-anarchy analyses of games of complete information suggest that coarse correlated equilibria, which characterize outcomes resulting from no-regret learning dynamics, have near-optimal welfare. This work provides two main technical results that lift this conclusion to games of incomplete information, a.k.a., Bayesian games. First, near-optimal welfare in Bayesian games follows directly from the smoothness-based proof of near-optimal welfare in the same game when the private information is public. Second, no-regret learning dynamics converge to Bayesian coarse correlated equilibrium in these incomplete information games. These results are enabled by interpretation of a Bayesian game as a stochastic game of complete information.
منابع مشابه
No-Regret Learning in Bayesian Games
Recent price-of-anarchy analyses of games of complete information suggest that coarse correlated equilibria, which characterize outcomes resulting from no-regret learning dynamics, have near-optimal welfare. This work provides two main technical results that lift this conclusion to games of incomplete information, a.k.a., Bayesian games. First, near-optimal welfare in Bayesian games follows dir...
متن کاملPre-Bayesian Games
Work on modelling uncertainty in game theory and economics almost always uses Bayesian assumptions. On the other hand, work in computer science frequently uses non-Bayesian assumptions and appeal to forms of worst case analysis. In this talk we deal with Pre-Bayesian games, games with incomplete information but with no probabilistic assumptions about the environment. We first discuss safety-lev...
متن کاملOn No-Regret Learning, Fictitious Play, and Nash Equilibrium
This paper addresses the question what is the outcome of multi-agent learning via no-regret algorithms in repeated games? Speci cally, can the outcome of no-regret learning be characterized by traditional game-theoretic solution concepts, such as Nash equilibrium? The conclusion of this study is that no-regret learning is reminiscent of ctitious play: play converges to Nash equilibrium in domin...
متن کاملOnline Learning with Variable Stage Duration
We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results establish the existence of no-regret strategies; such strategies secure a long-term average payoff that comes close to the maximal payoff that could be obtained, in hindsight, by playing any fixed action against the obser...
متن کاملConvex Repeated Games and Fenchel Duality
We describe and analyze an algorithmic framework for playing convex repeatedgames. In each trial of the repeated game, the first player predicts a vector andthen the second player responds with a loss function over the vector. Based on ageneralization of Fenchel duality, we derive an algorithmic framework for the firstplayer and analyze the player’s regret. We then use our a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1507.00418 شماره
صفحات -
تاریخ انتشار 2015